Towards Balanced Defect Prediction with Better Information Propagation

نویسندگان

چکیده

Defect prediction, the task of predicting presence defects in source code artifacts, has broad application software development. prediction faces two major challenges, label scarcity, where only a small percentage artifacts are labeled, and data imbalance, majority labeled non-defective. Moreover, current defect methods ignore impact information propagation among this negligence leads to performance degradation. In paper, we propose DPCAG, novel model address above three issues. We treat as nodes graph, learn propagate influence neighboring iteratively an EM framework. DPCAG dynamically adjusts contributions each node selects high-confidence for augmentation. Experimental results on real-world benchmark datasets show that improves compare state-of-the-art models. particular, achieves substantial superiority when measured by Matthews Correlation Coefficient (MCC), metric is widely acknowledged be most suitable imbalanced data.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Cross-Project Defect Prediction with Imbalanced Feature Sets

Cross-project defect prediction (CPDP) has been deemed as an emerging technology of software quality assurance, especially in new or inactive projects, and a few improved methods have been proposed to support better defect prediction. However, the regular CPDP always assumes that the features of training and test data are all identical. Hence, very little is known about whether the method for C...

متن کامل

Balanced mitochondria behave better

Chen et al. fi nd that evening out mitochondrial fusion and fi ssion allows the organelles to regain their normal function. Mutations in genes that control mitochondrial fusion or fission are responsible for various diseases, including dominant optic atrophy. In all of these diseases, either fusion or fi ssion is defective, and one poten...

متن کامل

Effects of Defect Propagation/Growth on In-Line Defect-Based Yield Prediction

This paper presents the importance of understanding defect propagation/growth and its impact on in-line yield prediction. In order to improve the prediction accuracy, impact of defect propagation and growth phenomena needs to be modeled and incorporated into yield prediction system. We developed a new yield prediction model by taking into account defect carryover. The empirical results of inter...

متن کامل

Is"Better Data"Better than"Better Data Miners"? (On the Benefits of Tuning SMOTE for Defect Prediction)

We report and fix an important systematic error in prior studies that ranked classifiers for software analytics. Those studies did not (a) assess classifiers on multiple criteria and they did not (b) study how variations in the data affect the results. Hence, this paper applies (a) multi-criteria tests while (b) fixing the weaker regions of the training data (using SMOTUNED, which is a self-tun...

متن کامل

Towards Better Understanding of Protein Secondary Structure: Extracting Prediction Rules

Although numerous computational techniques have been applied to predict protein secondary structure (PSS), only limited studies have dealt with discovery of logic rules underlying the prediction itself. Such rules offer interesting links between the prediction model and the underlying biology. In addition, they enhance interpretability of PSS prediction by providing a degree of transparency to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i1.16157